Deep Learning Systems by Andres Rodriguez
Author:Andres Rodriguez [Rodriguez, Andres]
Language: eng
Format: epub
Publisher: Morgan & Claypool Publishers
Published: 2020-02-14T22:00:00+00:00
This algorithm determines the layers that can be quantized. Note that one challenge is that interleaving layers with large and small numerical formats may result in higher computational cost from the overhead of the many conversions.
Cross-layer range equalization is a data-free quantization (requires no data and no back-propagation). The range of weights across the layers is equalized, and the range of activations are constraint under the assumption that a piece-wise linear activation function (such as ReLU) is used between the layers [NvB+19]. This constraint is satisfied by many CNN models but not by non-CNN models. This technique is used in the Qualcomm Neural Processing SDK.
Channel-wise quantization uses a quantization factor for each channel rather than one factor for the entire tensor.
Stochastic rounding (rather than nearest-value rounding) after multiplying by the quantization factor can improve performance [WCB+18]. To illustrate, rather than rounding the number 1.2 to the number 1, it is rounded to 1 with 80% probability and to 2 with 20% probability.
Unsigned int8 ReLU activations uses the unsigned int8 representation, rather than signed int8, for the activations of the ReLU functions. Using signed int8 wastes half of the values since all the activations are nonnegative.
The techniques QAT, selective quantization, channel-wise quantization, and stochastic rounding also benefit fp8 [CBG+20].
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8304)
Test-Driven Development with Java by Alan Mellor(6745)
Data Augmentation with Python by Duc Haba(6662)
Principles of Data Fabric by Sonia Mezzetta(6412)
Learn Blender Simulations the Right Way by Stephen Pearson(6308)
Microservices with Spring Boot 3 and Spring Cloud by Magnus Larsson(6182)
Hadoop in Practice by Alex Holmes(5961)
Jquery UI in Action : Master the concepts Of Jquery UI: A Step By Step Approach by ANMOL GOYAL(5809)
RPA Solution Architect's Handbook by Sachin Sahgal(5583)
Big Data Analysis with Python by Ivan Marin(5372)
The Infinite Retina by Robert Scoble Irena Cronin(5270)
Life 3.0: Being Human in the Age of Artificial Intelligence by Tegmark Max(5152)
Pretrain Vision and Large Language Models in Python by Emily Webber(4341)
Infrastructure as Code for Beginners by Russ McKendrick(4102)
Functional Programming in JavaScript by Mantyla Dan(4040)
The Age of Surveillance Capitalism by Shoshana Zuboff(3959)
WordPress Plugin Development Cookbook by Yannick Lefebvre(3815)
Embracing Microservices Design by Ovais Mehboob Ahmed Khan Nabil Siddiqui and Timothy Oleson(3617)
Applied Machine Learning for Healthcare and Life Sciences Using AWS by Ujjwal Ratan(3593)
